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Abstract 



< 

0^ [ We have created a genome-scale network reconstruction of Escherichia coli metabolism. 

Existing reconstructions were improved in terms of annotation standards, to facilitate 
their subsequent use in dynamic modelling. The resultant network is available from 
EcoliNet ( ,http://ecoh.sf.net/^ . 



EcoliNet 



The structure of metabolic networks can be determined by a reconstruction approach, using 
data from genome annotation, metabolic databases and chemical databases [1]. We built upon 

^ ■ an existing reconstruction of the metabolic network of E. coli that was based on genomic and 

literature data (known as iJ01366, [2J). This model contains 1366 genes, 2251 metabolic 

Q^ I reactions, and 1136 unique metabolites. Comparison to experimental data sets shows that it 

^N ■ makes accurate phenotypic predictions of growth on different substrates and for gene knockout 

■^ . strains |2|. 

C^ ' i JO 1366 suffers from the use of non-standard names and is not annotated with methods that 

are machine-readable. The model was thus updated according to existing community-driven 
annotation standards [3]. The reconstruction is described and made available in Systems 

^ I Biology Markup Language (SBML) (http://sbml.org/, [3]), an established community XML 

H ' format for the mark-up of biochemical models that is understood by a large number of software 



applications. The network is available from EcoliNet (http://ecoli.sf.net/). 



Annotation 

The highly-annotated network is primarily assembled and provided as an SBML file. Specific 
model entities, such as species or reactions, are annotated using ontological terms. These 
annotations, encoded using the resource description framework (RDF) [5j, provide the fa- 
cility to assign definitive terms to individual components, allowing software to identify such 
components unambiguously and thus link model components to existing data resources [6]. 
Minimum Information Requested in the Annotation of Models (MIRIAM, [7|) -compliant 
annotations have been used to identify components unambiguously by associating them with 



one or more terms from publicly available databases registered in MIRIAM resources [8]. 
Thus this network is entirely traceable and is presented in a computational framework. 

Nine different databases are used to annotate entities in the network (see Table [1]). The 
Systems Biology Ontology (SBO) [9] is also used to semantically discriminate between entity 
types. Eight different SBO terms are used to annotate entities in the network (see Table [2]). 
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Table 1: MIRIAM annotations used in the model. 
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SBO term 


interpretation 


cytoplasm 


290 


compartment 


(-)-ureidoglycolate 


247 


metabolite 


tRNA (Glu) 
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ribonucleic acid 


glgB 
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enzyme 


1,4-alpha-glucan branching enzyme 


176 


biochemical reaction 


1,4-alpha-glucan transport 
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transport reaction 


biomass objective function 
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modelling reaction 


glgB -^ 1,4-alpha-glucan branching enzyme 


460 


catalyst 



Table 2: SBO terms used in the model. 



Use 



We maintain the distinction between the E. coli GEnome scale Network REconstruction 
(GENRE) tlOj and its derived GEnome scale Model (GEM) [11] ■ This is important to dif- 
ferentiate between the established biochemical knowledge included in a GENRE and the 
modelling assumptions required for analysis or simulation with a GEM. A GENRE serves as 
a structured knowledge base of established biochemical facts, while a GEM is a model which 
supplements the established biochemical information with additional (potentially hypothet- 
ical) information to enable computational simulation and analysis [12] . Reactions added to 
the GEM include the biomass objective function - a sink representing cellular growth - and 
hypothetical transporters. 

Three versions of the network are made available: 



<organism>_<version>.xml, a GEM for use in flux analyses, provided in Flux Balance 
Constraints (FBC) format (13j 

<organism>_<version>_cobra.xml, the same GEM network, provided in Cobra for- 
mat [E 



<organism>_<version>_recon.xml, a GENRE containing only reactions for which there 
is experimental evidence 



YeastNet 

YeastNet is an annotated metabolic network of Saccharomyces cerevisiae S288c that is pe- 
riodically updated by a team of collaborators from various research groups. It started on 
the shoulders of previous reconstructions of the yeast metabolic network that were published 
separately (iLL672 [15] and iMM904 [I6]). However, due to the different approaches utilised, 
those earlier reconstructions had a significant number of differences. A community effort 
in 2007 resulted in a consensus network representation of yeast metabolism, reconciling the 
earlier results. 

As of December 2012, six versions of the network have been released (see Table [3]). 



version 


date 


publications 
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February 2008 


la 


2 


June 2009 


- 
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October 2009 
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m 


6 
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- 



Table 3: Development of YeastNet 



The EcoliNet and YeastNet networks are structured identically to facilitate comparative stud- 
ies. YeastNet is available from |http:/ /yeast. sf.net/ 
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